A performance-prediction model for PIC applications on clusters of Symmetric MultiProcessors: Validation with hierarchical HPF+OpenMP implementation
نویسندگان
چکیده
A performance-prediction model is presented, which describes different hierarchical workload decomposition strategies for particle in cell (PIC) codes on Clusters of Symmetric MultiProcessors. The devised workload decomposition is hierarchically structured: a higher-level decomposition among the computational nodes, and a lower-level one among the processors of each computational node. Several decomposition strategies are evaluated by means of the prediction model, with respect to the memory occupancy, the parallelization efficiency and the required programming effort. Such strategies have been implemented by integrating the high-level languages High Performance Fortran (at the inter-node stage) and OpenMP (at the intra-node one). The details of these implementations are presented, and the experimental values of parallelization efficiency are compared with the predicted results.
منابع مشابه
Hierarchical MPI+OpenMP Implementation of Parallel PIC Applications on Clusters of Symmetric MultiProcessors
The hierarchical combination of decomposition strategies for the development of parallel Particle-in-cell simulation codes, targeted to hierarchical distributed-shared memory architectures, is discussed in this paper, along with its MPI+OpenMP implementation. Particular emphasis is given to the devised dynamic workload balancing technique.
متن کاملA framework for an automatic hybrid MPI+OpenMP code generation
Clusters of symmetric multiprocessors (SMPs) are the most currently used architecture for large scale applications and combining MPI and OpenMP models is regarded as a suitable programming model for such architectures. But writing efficient MPI+OpenMP programs requires expertise and performance analysis to determine the best number of processes and threads for the optimal execution for a given ...
متن کاملHigh-performance Algorithms and Applications for Smp Clusters
The future of high-performance computing relies on the efficient and scalable use of clusters with symmetric multiprocessor (SMP) nodes and low-latency, high-bandwidth interconnection networks. Current examples of such platforms include Sun Ultra HPC machines, Compaq AlphaServers with Quadrics switches, SGI Origins, and the IBM SP system with SMP nodes. Moreover, the future of NASA missioncriti...
متن کاملInvestigating the Performance of Two Programming Models for Clusters of Commodity Smp
Multiprocessors and high performance networks allow to build CLUsters of MultiProcessors (CLUMPs). A main distinctive feature over traditional parallel computers is their hybrid memory model (message passing between the nodes and shared memory inside the nodes). We evaluate the performance of a cluster of 2-way SMP PCs connected by a Myrinet network for NAS benchmarks from two programming: a Si...
متن کاملA Grid based distributed simulation of Plasma Turbulence
Grid technology is widespreading, but most grid-enabled applications just exploit shared storage resources rather than computational ones, or utilize static remote allocation mechanisms of Grid platforms. In this paper the porting on a Globus equipped platform of a hierarchically distributed-shared memory parallel version of an application for particle-in-cell (PIC) simulation of plasma turbule...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Scientific Programming
دوره 11 شماره
صفحات -
تاریخ انتشار 2003